378 research outputs found
Current Time Series Anomaly Detection Benchmarks are Flawed and are Creating the Illusion of Progress
Time series anomaly detection has been a perennially important topic in data
science, with papers dating back to the 1950s. However, in recent years there
has been an explosion of interest in this topic, much of it driven by the
success of deep learning in other domains and for other time series tasks. Most
of these papers test on one or more of a handful of popular benchmark datasets,
created by Yahoo, Numenta, NASA, etc. In this work we make a surprising claim.
The majority of the individual exemplars in these datasets suffer from one or
more of four flaws. Because of these four flaws, we believe that many published
comparisons of anomaly detection algorithms may be unreliable, and more
importantly, much of the apparent progress in recent years may be illusionary.
In addition to demonstrating these claims, with this paper we introduce the UCR
Time Series Anomaly Archive. We believe that this resource will perform a
similar role as the UCR Time Series Classification Archive, by providing the
community with a benchmark that allows meaningful comparisons between
approaches and a meaningful gauge of overall progress
FastDTW is approximate and Generally Slower than the Algorithm it Approximates
Many time series data mining problems can be solved with repeated use of
distance measure. Examples of such tasks include similarity search, clustering,
classification, anomaly detection and segmentation. For over two decades it has
been known that the Dynamic Time Warping (DTW) distance measure is the best
measure to use for most tasks, in most domains. Because the classic DTW
algorithm has quadratic time complexity, many ideas have been introduced to
reduce its amortized time, or to quickly approximate it. One of the most cited
approximate approaches is FastDTW. The FastDTW algorithm has well over a
thousand citations and has been explicitly used in several hundred research
efforts. In this work, we make a surprising claim. In any realistic data mining
application, the approximate FastDTW is much slower than the exact DTW. This
fact clearly has implications for the community that uses this algorithm:
allowing it to address much larger datasets, get exact results, and do so in
less time
Temporal and spatial dynamics of scaling-specific features of a gene regulatory network in Drosophila
A widely appreciated aspect of developmental robustness is pattern formation in proportion to size. But how such scaling features emerge dynamically remains poorly understood. Here we generate a data set of the expression profiles of six gap genes in Drosophila melanogasterembryos that differ significantly in size. Expression patterns exhibit size-dependent dynamics both spatially and temporally. We uncover a dynamic emergence of under-scaling in the posterior, accompanied by reduced expression levels of gap genes near the middle of large embryos. Simulation results show that a size-dependent Bicoid gradient input can lead to reduced Krüppel expression that can have long-range and dynamic effects on gap gene expression in the posterior. Thus, for emergence of scaled patterns, the entire embryo may be viewed as a single unified dynamic system where maternally derived size-dependent information interpreted locally can be propagated in space and time as governed by the dynamics of a gene regulatory network
Prognostic nomogram for bladder cancer with brain metastases: a National Cancer Database analysis.
BACKGROUND: This study aimed to establish and validate a nomogram for predicting brain metastasis in patients with bladder cancer (BCa) and assess various treatment modalities using a primary cohort comprising 234 patients with clinicopathologically-confirmed BCa from 2004 to 2015 in the National Cancer Database.
METHODS: Machine learning method and Cox model were used for nomogram construction. For BCa patients with brain metastasis, surgery of the primary site, chemotherapy, radiation therapy, palliative care, brain confinement of metastatic sites, and the Charlson/Deyo Score were predictive features identified for building the nomogram.
RESULTS: For the original 169 patients considered in the model, the areas under the receiver operating characteristic curve (AUC) were 0.823 (95% CI 0.758-0.889, P \u3c 0.001) and 0.854 (95% CI 0.785-0.924, P \u3c 0.001) for 0.5- and 1-year overall survival respectively. In the validation cohort, the nomogram displayed similar AUCs of 0.838 (95% CI 0.738-0.937, P \u3c 0.001) and 0.809 (95% CI 0.680-0.939, P \u3c 0.001), respectively. The high and low risk groups had median survivals of 1.91 and 5.09 months for the training cohort and 1.68 and 8.05 months for the validation set, respectively (both P \u3c 0.0001).
CONCLUSIONS: Our prognostic nomogram provides a useful tool for overall survival prediction as well as assessing the risk and optimal treatment for BCa patients with brain metastasis
Specialized Re-Ranking: A Novel Retrieval-Verification Framework for Cloth Changing Person Re-Identification
Cloth changing person re-identification(Re-ID) can work under more
complicated scenarios with higher security than normal Re-ID and biometric
techniques and is therefore extremely valuable in applications. Meanwhile,
higher flexibility in appearance always leads to more similar-looking confusing
images, which is the weakness of the widely used retrieval methods. In this
work, we shed light on how to handle these similar images. Specifically, we
propose a novel retrieval-verification framework. Given an image, the retrieval
module can search for similar images quickly. Our proposed verification network
will then compare the input image and the candidate images by contrasting those
local details and give a similarity score. An innovative ranking strategy is
also introduced to take a good balance between retrieval and verification
results. Comprehensive experiments are conducted to show the effectiveness of
our framework and its capability in improving the state-of-the-art methods
remarkably on both synthetic and realistic datasets.Comment: Accepted by Pattern Recognitio
Revealing the link between evolution of electron transfer capacity of humic acid and key enzyme activities during anaerobic digestion
Humic acid (HA) is an important active compound formed during anaerobic digestion process, with a complex structure and dynamic electron transfer capacity (ETC). However, the mechanisms by which these macromolecular organic compounds dynamically interact with the microbial anaerobic digestion process at different operating temperatures are still unclear. In this study, the link between the evolution of the ETC of HAs and the microbial community under mesophilic and thermophilic conditions was investigated. The results showed an increasing trend in the ETC of HAs in both mesophilic (671–1479 μmol gHA−1) and thermophilic (774–1506 μmol gHA−1) anaerobic digestion (AD) until day 25. The ETC was positively correlated with the bacterial community of hydrolytic and acidogenic phases, but negatively correlated with the archaeal community of the methanogenic phase. Furthermore, the relationship between ETC and key enzyme activity was explored using a co-occurrence network analysis. HAs revealed a high potential to promote key enzyme activities during hydrolysis (amylase and protease) and acidification (acetate kinase, butyrate kinase, and phosphotransacetylase) while inhibiting the key enzyme activity in the methanogenic phase during the anaerobic digestion process. Moreover, HAs formed under thermophilic conditions had a greater influence on key enzyme activities than those formed under mesophilic conditions. This study advances our understanding of the mechanisms underlying the influence of HAs on anaerobic digestion performance
- …